Local Visual Microphones: Improved Sound Extraction from Silent Video

نویسندگان

  • Mohammad Amin Shabani
  • Laleh Samadfam
  • Mohammad Amin Sadeghi
چکیده

Sound waves cause small vibrations in nearby objects. A few techniques exist in the literature that can extract sound from video. In this paper we study local vibration patterns at different image locations. We show that different locations in the image vibrate differently. We carefully aggregate local vibrations and produce a sound quality that improves state-of-the-art. We show that local vibrations could have a time delay because sound waves take time to travel through the air. We use this phenomenon to estimate sound direction. We also present a novel algorithm that speeds up sound extraction by two to three orders of magnitude and reaches real-time performance in a 20KHz video. Figure 1: Left: Input high-speed video, a: Spectrogram of original sound. b: Spectrogram of our recovered sound. c: Spectrogram of sound recovery by [10].

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Design of Acoustic Security System in Near Field based on Paired Microphones and Automatic Video Camera

Many conventional security systems use visual information from a video camera. However, these systems may not be able to acquire the important scenes in the blind area of the video camera. Acoustic security systems can support conventional visual security systems with acoustic events detection. In our research, we focused on acoustic security systems in the near field, and we designed a prototy...

متن کامل

Non - Speech Acoustic Event Detection Using

Non-speech acoustic event detection (AED) aims to recognize events that are relevant to human activities associated with audio information. Much previous research has been focused on restricted highlight events, and highly relied on ad-hoc detectors for these events. This thesis focuses on using multimodal data in order to make non-speech acoustic event detection and classification tasks more r...

متن کامل

Seeing Through Noise: Speaker Separation and Enhancement using Visually-derived Speech

When video is recorded in a studio, sound is clear of external noises and unrelated sounds. However, most video is not shot at studios. Voice of people shot in family events is mixed with music and with other voices. Video conferences from home or office are often disturbed by other people, ringing phones, or barking dogs. TV reporting from city streets is mixed with traffic noise, sound of win...

متن کامل

Comparison of Time-Frequency Feature Extraction Techniques for Environmental Sound Recognition

This paper is the continuation of previously published work in which we have been analysing different methods – traditionally used in speech recognition – for their suitability to be applied to Environmental Sound Recognition. While current research devotes much effort to speech and speaker recognition, Environmental Sound Recognition is an area where little research has been reported. Despite ...

متن کامل

The visual microphone: Passive recovery of sound from video Citation

When sound hits an object, it causes small vibrations of the object’s surface. We show how, using only high-speed video of the object, we can extract those minute vibrations and partially recover the sound that produced them, allowing us to turn everyday objects—a glass of water, a potted plant, a box of tissues, or a bag of chips—into visual microphones. We recover sounds from highspeed footag...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1801.09436  شماره 

صفحات  -

تاریخ انتشار 2018